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Abstract. The radiochemical experiment in the Homestake mine was designed to measure the solar neutrino flux 
through the detection of 3 ' Ar produced in the reaction v e + 37 Cl — > e~ + 3 ' Ar. The comparison between this 
measurement and the theoretical predictions from solar models evidences a substantial disagreement. I reanalyzed 
the data evidencing a bias with high statistical significance and suggesting a new interpretation of the data. 
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1. Introduction 


The Homestake chlorine experiment has been running for 
over 20 years providing measurements of a portion of the 
solar neutrino flux. A detailed description of the experi¬ 
mental apparatus and of the analysis is given in [Cleveland 
|et al.| (1998) and Bahcall (1989). 

Briefly, the experiment consists of about 133 tons of 
37 Cl in the form of C 2 CI 4 located in a tank in the 


atic errors and a global likelihood analysis is applied to 
all of them. The final measurement of the neutrino flux is 
based on this latter analysis and therefore on data that are 
not presented in detail. Nevertheless, as several papers in 


m tne torm or 02014 located m a tame m 
Homestake mine. The solar neutrinos induce the reaction 
v e A 37 Cl —> e ~+ 37 Ar and the resulting 37 Ar is extracted 
and put into proportional counters, that measure energy 
and timing of each decay. The 37 Ar atoms are counted 
observing the 2.82 keV Auger electrons from the electron 
capture with a half life of 35.04d. The Auger electrons are 
selected by appropriate cuts on the rise time and selecting 
an energy window around the peak. 

A run results in a time series of decays that is fit to the 
exponential decay of 3 ‘ Ar plus a decaying background. 
The 37 Ar production rate and the background level in 
each run are obtained by maximizing the probability of 
obtaining the given time series with a maximum likeli¬ 
hood technique optimized for low counting rate [Cleveland 
et al. (1983). The same fit gives the ”lcr” errors on rate 
and background interpreted as 68% confidence range. The 
results are presented separately for each run, that covers 
approximately two months of data taking. 

In Cleveland et al. ( 1998| ) the data are presented as in the 
past (see Bahcall ( 1989| ) ) in a list of single run analysis 
obtained using tight cuts on the rise time and on the en¬ 
ergy window to optimize the dominant statistical error. 
Alternatively the data are selected with loose cuts on the 
rise time and on the energy window to reduce the system- 


t he pa st Bahcall et al. ( 1987 )- Bieber et al.| ( 1 990 j)-; Krauss 
( |l990| )- Filippone fe Vogel ( 199C )- Morrisonf ( |l992|) and ref¬ 
erences therein, the single run results will be reanalyzed 
to verify their consistency with the hypothesis of constant 
flux. 

This hypothesis has been challenged in the past by claims 
that the flux is correlated with the sun spot number or 
other parameter of solar activity or noticing that the fluxes 
measured over different time intervals differ significantly. 
I performed the hypothesis testing of constant flux by cal¬ 
culating the x 2 of If 16 distribution after having estimated 
the run errors. That results in a very poor agreement with 
the original hypothesis. 

Error independent analysises were also performed using 
the Smirnov and Kolmogorov hypothesis tests and using 
rank order statistic analysises. 

These show the signal and the background levels to be 
strongly correlated. 

Furthermore the average fluxes and the hypothesis tests 
are calculated separately for different partitions of the run 
set according to the background level rank. The result 
strongly suggests a bias in the data and a very bad \ 2 for 
the high background subset. 

Based on this analysis, I give an alternative higher esti¬ 
mation of the flux. 
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2. Data and errors 


The d a ta an alyzed in this paper are presented in [Cleveland 
et al. ( 1998 ) for N T = 108 runs in the following format: 
run start time, run stop time, run average time (account¬ 
ing for the decay of 37 Ar in the detector), production 
rate of 37 Ar resulting from the fit in atom per day, lower 
” Ict” error ( 68 % confidence range) on the production rate, 
higher ”lcr” error ( 68 % confidence range) on the produc¬ 
tion rate, counter background resulting from the fit in 
count per day, lower ”lcr” error ( 68 % confidence range) 
on the counter background, higher ”lc” error ( 68 % confi¬ 
dence range) on the counter background. These data are 
presented in Fig.^jfor the production rate and in Fig.||for 
the background. 

In testing an hypothesis (for example production rate p 
constant), it is necessary to assign an error to the rate 
measurement of each run. Previous analysis have devised 
different estimation of errors: equal on all runs, implicit 
in using rank-order statistic Bahcall fe Press| (199l[), av- 


erage of lower and higher errors 3ahcall et al 
the larger of the two Bahcal] (1989)-Bieber et al 


(1987), 

(1990)- 


Krauss (|199C|), calculated by rate and rate errors Filippom 


fc Vogel ( 1990| ). All these estimations seems incorrect. 

It is just the case to recall that if a measurement of a 
Poisson process of average p is n, the error is yjp and not 
-v Jn . This last estimation is approximately correct only for 
large n, that is in gaussian approximation. If only a single 
measurement is available, the best estimation of p is n 
and the two approaches coincide, but if several measure¬ 
ments are available, giving an estimation p of p 7 the best 
estimation of the errors on the single measurement is yfp. 
Furthermore, the combination of a Poisson process of av¬ 
erage p and a binomial distribution due to an efficiency 
e is a Poisson process of average pe. The errors on the 
production rate p reported by the fit refer to the esti¬ 
mation of p in the single run, while the best estimation 
p makes use of all runs. Therefore, in absence of back¬ 
ground counts, zero non-solar neutrino production rate 
and equal efficiency e for each run, the average counts ex¬ 
pected in run i of duration Ati are n, = peAt e J*, where 
A t e /f = (1 — exp(—^-)) ts7 Ar is the effective run time 

accounting for 37 Ar decay. Its error is ^fnl = \jpeAt e /^ 

that gives an error on the rate <r(p) = ■ [Cleveland 

et al. ( 1983 ) reports that the efficiency is constant and 
equal to e = 0.95 x 0.96 x 0.327 = 0.298. 

If the non-solar neutrino rate is non-zero, the Poisson pro¬ 
cess under measurement is the sum of two distinct Poisson 
processes, the solar neutrino rate, ps , and the non-solar 
neutrino rate Pi,NSi that is supposed known. The mea¬ 
surement gives an estimation pj = p$ + Pi,NSi from which 
Ps = P — Pns (the t rue known value of p^s is used). That 
gives of (ps) = g 2 (p) + ctUp ns) = o- 2 (p s ) + 2 a 2 {p NS ), 

that is (Ti{ps) = ^ t P eif S ■ 

In presence of background, the signal can be identified 


from the decay time of the atoms. Following a hint in 
Bahcall (1989), the background measurement stems from 
the counting in the signal free region at time larger com¬ 
pared to T 37 Ar, while the signal is measured from the 
counting within a few T 37 a t . The statistical effect of the 
background can be evaluated expressing the background 
in term equivalent to signal rate and then applying the 
same approach used for the non solar rate component. 
The background induced rate is pbk = > where bi is 


the background rate. The underlying assumption is that 
the contribution of the background to the signal measure¬ 
ment is concentrated in T 37 

With these definitions the total error on the solar rate is 


ch:(ps) 


ps + 2pi,NS+'2pi,bk 

TKiP 1 ' 


It is apparent that, in order to use this errors to calculate 
anything relevant, an estimation of the rate must be al¬ 
ready available. That is obtained through the unweighted 
average (actually weighted only through the effective run 
time length) 


Puw Ei At i ff 

The weighted average is instead obtained as 


( 1 ) 


P = 


Ei 


Ei 


PiAt‘ ff 
A t eff 


( 2 ) 


3. Hypothesis testing 


The hypothesis of constant flux or, more generally, of con- 
sistency of the data set can be t ested in several ways 
Frodesen et al. ( 197S )- Eadie et al. ( |l971 ). 

The Pearson’s y 2 method quantifies the consistency of the 
constant flux hypothesis making use of the errors on the 
full sample, calculating 


X' 


\N~1) = J2 


Pt-P 


( 3 ) 


where N is the number of data points. 

Partitioning the data set in two subsamples A and B, the 
same approach can be repeated on each subsample as long 
as the estimator p is replaced by the subsample estimator 
p A ( B \ A test of compatibility between the two samples is 
obtained estimating the probability of the difference be¬ 
tween the two estimators. 

The disadvantage of this approach is that it strongly relies 
on the exact evaluation of the errors. As discussed previ¬ 
ously the presence of background creates some ambiguity 
in defining them. 

Alternatively we can assume that all the data have 
the same weights and employ the hypothesis test of 
Kolmogorov and Smirnov. The test requires to order 
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the N observation on the variable p in ascending order 
(xi • • -Xn) and build their cumulative distribution 

{ 0 x < X\ 

£ Xi < X < Xj+l 

1 x > xn 

This distribution is compared with the cumulative dis¬ 
tribution function F(x) occuring for a Poisson process at 
the constant rate determined through the weighted aver¬ 
age. In principle the test is applicable only if the compar¬ 
ison function has no parameter determined by the data 
themselves; that is not our case but it is fair to assume 
that one parameter determined out of a large data sample 
will have little influence on its outcome. 

The distance between the experimental and theoretical cu¬ 
mulative distributions is calculated in two different met¬ 
rics: the first, named after Kolmogorov, is 

Dn = max x \SN(x) - F(x )|, 


the second, named after Smirnov, is 

/ +oo 

(S N (x) - F(x)) 2 dF(x). 

-OO 

The distribution functions of D^y/lX and W 2 N can be 
calculated and are available as analytical formulae, tables 
or recurrence relations implemented by library routines. 
The same approach and the same formulae allows the com¬ 
parison between two set of data to verify if they come from 
the same original distribution. The distance between the 
cumulative distributions of two samples of size M and N 
is, after Kolmogorov, 


D N m = max x \S N (x) - Sm^)! 

where Dnm\/MN/(AI + N) has the same probability 
distribution as D^y/N and, after Smirnov, 


W$/m — 


p + OO 


( S N (x ) - S M (x)) 2 d 


NS n (X) + MS m (X) 


N + M 


where W^ m (MN/(M + N)) has the same probability dis¬ 
tribution as W^N 2 . The test is exact because there are 
no estimated parameters. 


3.1. The full data sample 

The previous hypothesis tests applied to the full data sam¬ 
ple under the hypothesis of constant flux give the following 
probabilities: 


P(y 2 ) = 0.33% 
P{D n ) = 3.62% 
P(W%) = 2.79% 


4. Rank measure of association 


Another approach to the data is studying the correlation 
between the measured production rate in each run and 
another run dependent quantity. The candidate quantities 
are the run epoch, the effective run time length and the 
background rate. Alternatively any quantity defined ver¬ 
sus the time epoch can be used. In Bahcall & Press (1991), 
Krauss| (1990), Bieber et al. ( 199C| ) several sun’s activity 
dependent quantities are used, e.g. the mean sunspot num¬ 
ber. 

An estimation of correlation can be done using the stan¬ 
dard correlation coefficient as in Krauss| (1990) according 
to Press et al. (1986) or using rank-order statistic as in 


Bahcall & Press (1991). The latter is more robust because 


it makes no assumption on the underly ing distributions 
nor on the data errors. As suggested in 


(1991) and described in Press et al. (1986) the rank statis- 


Bahcall & Pres: 


tical tests of Spearman rank-order correlation coefficient 
and Kendall’s r are applied. 

The principle is that if two quantities labelled by run in¬ 
dex are rank ordered, the more uniform is the resulting 
scatte r plot the less the two quantities are correlated. 

In Fig. 3(a) , the background rate is plotted versus the 37 Ar 
production rate, as well as the corresponding rank ordered 
quantities. The rank ordered plot shows a denser diagonal 
band suggesting a significant anticorrelation. 


4.1. The full data sample 

The Spearman and Kendall tests are applied to the full 
data sample measuring the correlation between the 37 Ar 
production rate and the run epoch (f r ), the effective run 
time length (t e //) and the background rate (back). The 
result is expressed as the probability to obtain the ob¬ 
served correlation parameter from two uncorrelated dis¬ 
tributions: 



W/ 

t r 

back 


Spearman r s 

4.95% 

2.09% 

9.5 10~ : 

3 % 

Spearman D 

5.15% 

2.04% 

14.310- 

- 3 % 

Kendall 

5.76% 

1 .88% 

14.910- 

-3% 


From this table it is apparent that there is little (auto¬ 
correlation between the Argon production rate and f e //, 
a little more with the run epoch and a very strong one 
with the background level, that is visible in Fig. |3(b)| . This 
correlation has no physical justification and has never been 
noticed before. Its strength is comparable to the strongest 
correlation identified in previous papers Kraus:) (1990), 
Bahcall & Press (1991), Bieber et al. ( |1990| ). 

It suggests that the analysis algorithm decreases the signal 
in presence of background and therefore that the average 
production rate is underestimated. 


5. Partitioning the data set 


The smallest one (y 2 ) has the limit of relying heavily 
on a delicate procedure of error estimation. The others are 
more robust but not small enough to be conclusive. 


The previous results provide strong suggestions that the 
37 Ar production rate is anticorrelated with the back¬ 
ground and that the overall consistency of the data under 
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Fig. 1. Solar neutrino rate with the Homestake experiment on a run basis. 



1 FWHM Results 


Year 


Fig. 2. Background in the Homestake chlorine experiment on a run basis. 


the hypothesis of constant flux is poor. Partitioning the 
data set in two subsets might help to gain more under¬ 
standing on this disagreement. 

To limit the arbitrariness of the partition, we consider the 
run set ranked according to the background rate and, for 
each number 1 < TV < Nt, the partition in the two sets 
including the runs with the N lower background rate and 
the runs with the Nt — N higher background rate. That 
gives Nt — 1 partitions in two sets. 

For each set in each partition the average production rate 


is recalculated and the previous hypothesis tests are ap¬ 
plied as well as the rank order tests. The hypothesis tests 
are applied also on the pair of sets of each partition to 
estimate the probablity that they come from the same 
consta nt distribution . 

In Fig. 4(a)|j4(b) - 1(c) - 4(d) the test and rank probabilities 
are plotted versus the background rate ranked run num¬ 
ber. In Fig .|| the probabilities of low and high background 
ranked runs coming from the same constant distribution 
are plotted and in Fig.|| the average 37 Ar production rate 
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(a) (b) 

Fig. 3. a) Background rate versus 37 Ar production rate b) Background rate rank versus 37 Ar production rate rank. 
In the null hypothesis of no correlation, the points should be uniformly distributed. 


for the two sets is shown. 

What is apparent is that when the hypothesis and the 
rank tests are restricted to the lower two third of the runs 
(about 70), the experimental data are coherent with the 
hypothesis of constant 37 Ar production rate and the av¬ 
erage value is constant with the run rank cut. Also the 
upper third of the runs, albeit less clearly, is a data set 
coherent with the hypothesis of constant 37 Ar production 
rate even if the average 37 Ar production rate depends on 
the run rank cut. 

The plot in Fig.|| demonstrate that the probability that 
the two complementary sets belong to the same distribu¬ 
tions reaches a minimum around a value of 70. 

The most natural interpretation is that the low and 
high background sets come from two distinct populations 
with different averages. The low background population 
is highly coherent and unbiased and therefore gives the 
most reliable estimate of the 31 Ar production rate. The 
high backgorund population is less coherent, as it is to 
be expected if the measurement of the production rate is 
biased by the background, and does not provide a clear 
measurement of the average. 


6. Conclusion 

The conclusion from the previous results is that the analy¬ 
sis on the data from the Homestake experiment should be 
restricted to the subset of about two third of the runs with 


low background. The runs with large background should 
be discarded. 

The somehow arbitrary choice of the cut in the back¬ 
gorund rank adds a small uncertainties to the estimation 
of the 3 ‘ Ar production rate. Choosing N = 70, using the 
weighted average as estimator and retaining the same sys¬ 
tematic error of the original paper, the result is 

0.566 ± 0.030 (statistical) ± 0.030 (systematic)day -1 

or 

3.03 ± 0 A6(statistical) ± 0 A6(systematic)SNU 

That is larger of almost three statistical stand a rd de via- 
tion than the original result in Cleveland et al. (1998) 
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(c) (d) 

Fig. 4. a) Probability of constant 37 Ar rate of the lowest N background runs for Smirnov (solid), Pearson (dashed) 
and Kolmogorov (dotted) b) Probability of constant 37 Ar rate of the highest Nt — N background runs for Smirnov 
(solid), Pearson (dashed) and Kolmogorov (dotted) c) Probability of no correlation between 3 ' Ar rate and background 
of the lowest N background runs for Kendall (solid), Spearman D (dashed) and Spearman r s (dotted) d) Probability 
of no correlation between 3 ' Ar rate and background of the highest Nt — N background runs for Kendall (solid), 
Spearman D (dashed) and Spearman r s (dotted) . 
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Fig. 5. Probability of constant and equal 37 Ar rate between the lowest N and highest Nt — N background runs for 
Smirnov (solid), Pearson (dashed) and Kolmogorov (dotted) 
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Fig. 6. Weighted average of the 37 Ar rate of the lowest N (solid) and highest Nt — N (dashed) background runs 
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